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Abstract. In a recent work of the authors and Kim, we derived a com- 
plete description of the largest component of the Erdos-Renyi random 
graph Q{n,p) as it emerges from the critical window, i.e. forp = {l+e)/n 
where e^n oo and e — o(l), in terms of a tractable contiguous model. 
Here we provide the analogous description for the supercritical giant 
component, i.e. the largest component of Q{n,p) for p = X/n where 
A > 1 is fixed. The contiguous model is roughly as follows: Take a 
random degree sequence and sample a random multigraph with these 
degrees to arrive at the kernel; Replace the edges by paths whose lengths 
are i.i.d. geometric variables to arrive at the 2-core; Attach i.i.d. Poisson 
Galton- Watson trees to the vertices for the final giant component. As 
in the case of the emerging giant, we obtain this result via a sequence 
of contiguity arguments at the heart of which are Kim's Poisson-cloning 
method and the Pittel-Wormald local limit theorems. 



1. Introduction 

The famous phase transition of the Erdos and Renyi random graph, in- 
troduced in 1959 [13j, addresses the double jump in the size of the largest 
component Ci in Q{n,p) for p = X/n with A > fixed. When A < 1 it 
is logarithmic in size with high probability (w.h.p.), when A = 1 its size 
has order n^/^ and when A > 1 it is linear w.h.p. and thus referred to as 
the giant component. Of the above facts, the critical behavior was fully 
established only much later by Bollobaas [8j and Luczak [17J, and in fact 
extends throughout the critical window of p = {1 ± e)/n for e = 0{n~^/^) 
as discovered in [8]. 

As far as the structure of Ci is concerned, when p = X/n for fixed A < 1 
in fact this component is w.h.p a tree of a known (logarithmic) size. The 
structure and size of the largest components was established in and [2j , 
where in the latter work Aldous showed a remarkable connection between the 
critical random graph, continuum random trees and Brownian excursions. 
(See also the recent work [1] further studying the component structure at 
criticality, as well as [THH] for further details.) 

As opposed to the tree-like geometry at and below criticality, the struc- 
ture of the largest component becomes quite rich as soon as it emerges from 
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the critical window, i.e. at p = (1 + e)/n where s = o(l) and e^n — )• oo. De- 
spite many works devoted to the study of various properties of the largest 
component in this regime, the understanding of its structure remained fairly 
limited, illustrated by the fact that one of its most basic properties — the 
diameter — was determined asymptotically only lately in [lOJ and indepen- 
dently in j21j . In the context of our present work, out of the various decom- 
position results on the structure of Ci it is important to mention those by 
Luczak [18], highlighting the kernel as a random graph with a given degree 
sequence, and by Pittel and Wormald [20] , featuring very precise estimates 
on the distribution of the size of Ci and its 2-core (The 2-core of a graph is 
its maximum subgraph where all degrees are at least 2. The kernel of Ci is 
obtained from its 2-core by replacing every maximal path where all internal 
vertices have degree 2 by an edge.). 

Recently, the authors and Kim [9] established a complete characterization 
of the structure of Ci throughout the emerging supercritical regime, i.e. when 
p = {1 + e)/n with e^n — )• oo and e = o(l). This was achieved by offering 
a tractable contiguous model Ci, in other words, every graph property An 
that is satisfied by Ci w.h.p. (that is, a sequence of simple graphs such that 
P(Ci G An) — >■ 1) is also satisfied by Ci w.h.p. The contiguous model has 
a particularly simple description in the early stage of the formation of the 
giant, namely when p = (1 + e)/n with e^n — )• oo and e = o{n~^/^): 

(i) Sample a random 3-regular multigraph on 2[Z\ vertices via the con- 
figuration model, where Z is Guassian with parameters Af{^s^n,e^n). 

(ii) Subdivide each edge into a path of length i.i.d. Geometric(e). 

(iii) Attach i.i.d. Poisson(l — e)-Galton- Watson trees to each of the vertices. 
(In the above, a Poisson(^)-Galton- Watson tree is the family tree of a 
Galton- Watson branching process with offspring distribution Poisson(/i). 
See ^2.21 for the definition of the configuration model.) 

The advantages of the aforementioned characterization were demonstrated 
in two companion papers [lOpilj. The first of these settled the natural 
question of the asymptotic behavior of the diameter throughout the emerg- 
ing supercritical regim^, achieved by combining the structure result with a 
straightforward analysis of first-passage-percolation. The second established 
the order of the mixing time of the random walk on Ci, previously known 
only within the critical window and in the strictly supercritical regime, lack- 
ing the interpolating regime between them. See [9] for other applications of 
this result to easily read off key properties of Ci . 

"'^Prior to our work, [21] had independently and using a different method obtained the 
asymptotic diameter in most but not all of the emerging supercritical regime. Following 
our work they managed to close this gap. Note that the estimate there is quite precise, 
whereas our work only aimed to obtain the leading order term throughout the regime. 
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In this work we provide the analogous description for the strictly super- 
critical giant component, i.e. p = (1 + e)/n where e > is fixed. 

Theorem 1. Let Ci be the largest component ofQ{n,p) for p = X/n where 
X > 1 is fixed. Let fi < 1 be the conjugate of X, that is iigT^ = Ae~^. Then 
Ci is contiguous to the following model Ci : 

1. Let A he Gaussian J\f {X — fi,l/n) and let ~ Poisson(A) for u G [n] 
be i.i.d., conditioned that ^Dul£)^>3 is even. Let 

Nk = #{n : Du = k} and N = ^^^^ Nk . 

Select a random multigraph fC on N vertices, uniformly among all 
multigraphs with vertices of degree k for k > 3. 

2. Replace the edges of IC by paths of i.i.d. Geom(l — fj,) lengths. 

3. Attach an independent Foisson{fj,)-Galton-Watson tree to each vertex. 

That is, P(Ci £ A) ^ implies P(Ci £ A) ^ for any set of graphs A. 

(In the above, the notation Geom(l — fi) denotes the geometric variable 
assuming the value A; > 1 with probability — fi).) We note that 

conditioning that Du^Du>3 is even can easily be realized by rejection 
sampling. Alternatively, this requirement can be replaced by adding a self- 
loop (counting 1 to the degree) to one of the vertices whenever the sum 
is odd. Further note that in the above recipe for Ci, Step [T] constructs the 
kernel. Step [2] constructs the 2-core and finally the entire giant is constructed 
in the Step [3l 

To demonstrate how one can easily derive nontrivial properties of Ci from 
the above theorem, observe for instance that one can immediately infer that 
the longest path of degree-2 vertices in the 2-core is of size log^/^ n + Op{l). 
Indeed, there are order n edges in the kernel, hence the above quantity is 
simply the maximum of order n i.i.d. geometric variables with mean 1 — fi. 

As another example, we note that it was well-known prior to this work 
that the giant component consists of an expander "decorated" using paths 
and trees of at most logarithmic size (see [5] for a concrete example of such 
a statement, used there to obtain the order of the mixing time on the fully 
supercritical Ci). This is immediately apparent from the above description 
of Ci: indeed, it straightforward to show that the kernel is typically an ex- 
pander (see, e.g., ^ Lemma 3.5] where this was shown for the kernel in the 
emerging supercritical regime). The decorations spoiling its expansion as 
described in the decomposition results a la [5] are due to (i) edges subdi- 
vided into arbitrarily large paths via the geometric variables (ii) attached 
Poisson Galton- Watson trees of arbitrarily large size. In both cases, the size 
of the decoration is constant in expectation (depending on A) and has an 
exponential tail, reproducing the above depicted picture. 
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1.1. Main techniques and comparison with [9J. Our framework for 
obtaining the description of the largest component in Q{n,p), following the 
framework of [9], consists of three main contiguity arguments. Our starting 
point is the Poisson cloning model Gpc{n,p) due to [E], which is contiguous 
to G{n,p) (see The first step is to reduce the 2-core of ^pc(n,p) to a 
random graph with a given (random) degree sequence (Theorem 13.21 in 
The second step reduces this to a model where a kernel is expanded to a 
2-core by subdividing its edges via i.i.d. geometric variables (Theorem 14.21 
in The final step handles the attached trees and completes the proof of 
the main theorem (©. 

It is already at the first step where the analysis of our previous work [9] 
breaks when p = X/n for fixed A > 1. Our original approach at this stage 
relied on showing that a certain stopping time Tpc for a process that pro- 
duces the 2-core (the so-called COLA algorithm due to Kim) is absolutely 
continuous w.r.t. Lebesgue measure (after normalizing it by its standard de- 
viation). However, crucial in that proof was the fact that the p = {1 + e)/n 
and e = o(l), e.g. illustrated by the fact that the size of the 2-core given 
the aforementioned Tpc has a standard deviation smaller than the mean by 
a factor of ^/£, and as such is concentrated when e — )• 0. New arguments 
were required to establish Theorem 13. 2^ including the use of the powerful 
Pittel-Wormald [20] local limit theorems already in this stage of the proof 
(cf. [9] where this tool was applied only in the second stage of the reduc- 
tions). Finally, various arguments were simplified, either in places where the 
dependency in e would no longer play a role or in situations where the fact 
that 1/e = 0(1) allows direct application of standard local limit theorems. 

2. Preliminaries 

2.1. Cores and kernels. The k-core of a graph G, denoted by G^^\ is its 
maximum subgraph H C G where every vertex has degree at least k. This 
subgraph is unique, and can be obtained by repeatedly deleting any vertex 
whose degree is smaller than k (in an arbitrary order). The kernel /C of G is 
obtained by taking its 2-core G^^^ minus its disjoint cycles, then repeatedly 
contracting any path where all internal vertices have degree-2 (replacing it 
by a single edge). Notice that, by definition, the degree of every vertex in /C 
is at least 3. At certain times the notation ker(G) will be useful to denote 
a kernel with respect to some specific graph G. Note that ker(G) is usually 
different from the 3-core of G. 

2.2. Configuration model. This model, introduced by Bollobas [6j, pro- 
vides a remarkable method for constructing random graphs with a given 
degree distribution, which is highly useful to their analysis. We describe 
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this for the case of random d-regular graphs for d fixed (the model is similar 
for other degree distributions); see [71 ll41 [22] for additional information. 

Associate each of the n vertices with d distinct points (also referred to 
as "half-edges"), and consider a uniform perfect matching on these points. 
The random d-regular graph is obtained by contracting each cluster of the 
d points corresponding to a vertex, possibly introducing multiple edges and 
self-loops. Clearly, on the event that the obtained graph is simple, it is 
uniformly distributed among all d-regular graphs, and furthermore, one can 
show that this event occurs with probability bounded away from (namely, 
with probability about exp(^-^)). Hence, every event that occurs w.h.p. 
for this model, also occurs w.h.p. for a random d-regular graph. 

2.3. Poisson cloning. Following is a brief account on the Poisson cloning 
model Gpc{n,p), introduced in [15iil6j. Let V be the set of n vertices, and 
Po(A) denote a Poisson random variable with mean A. Let {d{v)}v^v be a 
sequence of i.i.d. Po(A) variables with A = (n — l)p. Then, take d{v) copies 
of each vertex v €z V and the copies of v are called clones of v or simply 
v-clones. Define Nx = Z^^gy ^^(^^)- If -^A is even, the multi-graph Qpc{n,p) 
is obtained by generating a uniform random perfect matching of those Nx 
clones (e.g., via the configuration model, where every clone is considered to 
be a half-edge) and contracting clones of the same vertex. That is to say, 
each matching of a w-clone and a w-clone is translated into the edge {v, w) 
with multiplicity. In the case that v = w, it contributes a self- loop with 
degree 2. On the other hand, if Nx is odd, we first pick a uniform clone and 
translate it to a special self-loop contributing degree 1 of the corresponding 
vertex (this special self-loop plays no important role in the model and can 
be neglected throughout the paper). For the remaining clones, generate a 
perfect matching and contract them as in the Nx even case. 

The following theorem of [16] states that the Poisson cloning model is 
contiguous with the Erdos-Renyi model. Hence, it suffices to study Poisson 
cloning model in order to establish properties of the Erdos-Renyi model. 

Theorem 2.1 ( [16^ Theorem 1.1]). Suppose p x 1/n. Then there exist 
constants ci,C2 > such that for any collection T of simple graphs, we have 

ciP(gpc(n,p) G J-) < F{g{n,p) € T) < C2{nGpc{n,p) G -F))'/' +e-") . 

Note that as p = A/n for A > 1 fixed we may clearly replace the rate 
A = (n — l)p in the Poisson-cloning model definition simply by A = np. 

2.4. Local limit theorem. Throughout the proofs we will need to establish 
local limit theorems for various parameters in the graph. To this end, we 
will repeatedly apply the following special case of a result in jl2] . 
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Theorem 2.2 ( |12l Ch. 2, Theorem 5.2], reformulated). Let X he a random 
variable on N with P(X = A;) > for all A; G N. Suppose that EX = < oo 
and Var X = 0"^ < oo. Let Xi be i.i.d distributed as X and Sm = X^i^i ^i- 
Then as m ^ oo, we have 



By Theorem 12.11 the random graph Q{n,p) in our range of parameters is 
contiguous to the Poisson doning model, where every vertex gets an i.i.d. 
Po(A) number of half-edges (clones) and the final multigraph is obtained 
via the configuration model. In this section we will reduce the 2-core of the 
supercritical Poisson cloning model to a more tractable model — a random 
graph uniformly chosen over all graphs with a given degree sequence. 

Definition 3.1 (Poisson-configuration model with parameters n and A). 
(1) Let A ~ A/'(A — ;U, 1/n), consider n vertices and assign an independent 
variable Du ~ Po(A) to each vertex u. Let = #{n : Du = k} and 



(2) Construct a random multigraph on N vertices, uniformly chosen over 
all graphs with Nk degree-k vertices for k > 2 (if N is odd, choose a 
vertex u with = k>2 with probability proportional to k, and give it 
k — 1 half-edges and a self-loop). 

Theorem 3.2. Let G ~ ^pc(n,p) be generated by the Poisson cloning model 
for p = \/n, where A > 1 is fixed. Let G^"^^ be its 2-core, and H be generated 
by the Poisson-configuration model corresponding to n,p. Then for any set 
of graphs A such that F{H £ A) ^ 0, we have P(G(2) ^ ^) ^ q_ 

To prove the above Theorem 13.21 we outline a specific way to generate 
Qpc{n,p) due to |16j . Let V he a set of n vertices and consider n horizontal 
line segments ranging from (0,j) to (A,j), for j = 1, . . . ,n in M? . Assign a 
Poisson point process with rate 1 on each line segment independently. Each 
point (x, v) in these processes is referred to as a t;-clone with the assigned 
number x. The entire set of Poisson point processes is called a Poisson A-cell. 

Given the Poisson A-cell, various schemes can be used to generate a perfect 
matching on all points. One such way is the "Cut-Off Line Algorithm" 
(COLA), defined in |15j . which is useful in finding the 2-core G^'^\ described 
as follows. The algorithm maintains the position of a "cut-off line" , a vertical 
line in M? whose initial x-coordinate equals A, and gradually moves leftwards. 
In the beginning the line is positioned at A, and as the line progresses it 




where C 



3. The 2-core of Poisson cloning 
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matches previously unmatched clones. To describe the stopping rule of 
the algorithm, we need the following definitions. At any given point, we 
call a vertex v G V (and its unmatched clones) light if it has at most one 
unmatched clone (and heavy otherwise). At the beginning of the process, 
all the light clones are placed in a stack. The order by which these clones 
are inserted into the stack can be arbitrary, as long as it is oblivious of the 
coordinates assigned to the clones. Define Tpc to be the x-coordinate of the 
cut-off line once the algorithm terminates, i.e., at the first time when there 
are no light clones. We will argue that Tpc is concentrated about X — fi with 
a standard deviation of l/\/n, yet before doing so we explain its role in 
determining the structure of the 2-core of the graph. The above algorithm 
repeatedly matches light clones until all of them are exhausted — precisely 
as the cut-off line reaches Tpc- As stated in ^ the 2-core of a graph can 
be obtained by repeatedly removing vertices of degree at most 1 (at any 
arbitrary order), thus it is precisely comprised of all the unmatched clones 
at the moment we reach Tpc- 

Algorithm 1 Cut-Off Line Algorithm 

1. Let (x, u) be the first clone in the stack. Move the cut-off line leftwards 
until it hits an unmatched clone {y,v) ^ {x,u). 

2. Remove {x,u) from the stack, as well as {y,v) (if it is there). 

3. Match {x,u) and {y,v) and re-evaluate u and v as light/heavy. 

4. Add any clone that just became light into the stack. 

5. If the stack is nonempty return to Step [3l otherwise quit and denote 
the stopping time by Tpc (final x-coordinate of cut-off line) . 



The following theorem establishes concentration for Tpc- Its proof will 

(2) 

follow from known estimates on the concentration of \C\ \ in Q{n,p). 

Theorem 3.3 (Upper bound on the window of Tpc). There exist constants 
C,c> so that for all ^ > with 7 = o(-v/n), the following holds: 

P(|rpc-(A-/x)| >7/V^) <Ce-^^'. (3.1) 

Proof. It is well known that in the super-critical random graph Q{n,X/n) 

with A > 1 fixed w.h.p. all components except one (the giant) are trees or 

unicyclic (see e.g. [HI Theorem 5.12]) and in addition the total expected 

number of vertices which belong to unicyclic components is bounded (see 

e.g. [3 Theorem 5.23]). In particular this implies that the 2-core of G{ti, A/n) 

(2) 

for A > 1 fixed consists of C) , the 2-core of the giant component, plus 
disjoint cycles whose total number of vertices is w.h.p. at most, say, 0(log n). 
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A special case of a powerful result of Pittel and Wormald [20j (the full 
statement of this theorem appears later as Theorem I5.ip implies that 

E|cf)| = (l-M)(l-> 

(2) (2) 

and in addition (|C| | — E|C| |)/ -^/n is in the limit Gaussian with variance of 
0(1). Combining these facts, there exists some fixed c > so that a random 
graph F ~ Q{n,p) in our regime has a 2-core F^'^^ whose size satisfies 

P - (1 - - !i.)n\\ > -f/Vnj < eicp{-c-f^) . 

By Theorem 12. II it then follows that for G ~ Qpc{n,p), 

P - (1 - ^)(1 - B.)n\\ > < Cexp(-C72) , 

where C = 1/ci from that theorem. 

To conclude the proof, observe on the event Tpc = x, the size of IS 
binomial with parameters Bin(n,p^(x)) where 

k>2 

It is easy to verify that x = A — /u is the unique positive solution of P2^(2;) = 
(1 — — j). This function further has -^pti^) = xe~^ thus its derivative 
is uniformly bounded away from in the interval [^(A — ^),2(A — fx)]. In 
particular, shifting Tpc by 'j/\/n would shift the mean of the above variable 
by order ^/\/n and the desired result follows. ■ 

The above theorem established the concentration of Tpc and as such re- 
duced the Poisson-cloning model to the Poisson-configuration model given 
the event Tpc = A — /i + o(l). With this in mind, the argument in the proof 
above stating that the disjoint cycles outside the giant component have 
bounded expectation in Q{n,p), along with the contiguity between Q{n,p) 
and Poisson-cloning, now immediately yield the following: 

Corollary 3.4. Let H be generated by the Poisson-configuration model given 
A = £, where £ = X — fi + o(l). Define H' as the graph obtained by deleting 
every disjoint cycle from H . Let N2 he the number of vertices with degree 2 
in H, and N2 be the corresponding quantity for H' . Then N2 = A^2 + 0p(l)• 
3.1. Contiguity of Poisson-cloning and Poisson-configuration. A key 
part of showing the contiguity result is the following lemma which controls 
the edge distribution in the Poisson-configuration model. 

Lemma 3.5. Let Nj. denote the number of degree-k vertices in the Poisson- 
configuration model, and set Aq = X — fi. For any fixed M > there exist 
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some ci,C2 > such that the following holds: If n^jU^, . . . satisfy 



„(l_e-Ao(i + Ao + f))-Efc>3^fe 
nAo (1 - e-^o (1 + Ao)) - Zk>3 

and X satisfies |a; — Ao| < Mj ^fn then 

P (iVfc = nfc /or a// A; > 3 I A = 



ci < 



< 

\ 

<C2. 



' (iVfe = njfc for allk>?,\k = Aq) 

Proof. Throughout the proof of the lemma, the imphcit constants in the 
O(-) notation depend on M. Write m = X]a;>3 ^fc ~ J2k>3^''^k, and 

let ^4 = A{n3, n^, . . .) denote the event {N^ = nj. for all k > 3}. Setting 

„ „ . P (A I A = x) 

" = "("'^°^ = F(^|A = Ao) ' 
we are interested in uniform bounds for H from above and below. Let 

p,^{x) = P(Po(a;) = k)= e'^'x^/kS , and p^{x) = P(Po(a;) < k) . 

and observe that 



P2 (^) 
^2(^0) 



n 



Pk[x) 



= g-n(x-Ao) 



1 + Ao + ^ 



X 



and so 



log; 



n(Ao — x) + (n — m) log 



1 + Ao + ^ 



+ r log — . 
Ao 



Using Taylor's expansion and recalling that x — Aq = 0(l/y^), 



log 



1 + X+ ^ 

1 + Ao + ^ 



^^^\, (x - Ao) + 0(l/n) , 
1 + Ao + ^ 



and we deduce that 

log E = n(Ao — x) + (n — m) - 



1 + Ao 



(x-Ao)+r^-^ + 0(l) 



A, 







1 + Ao + ^ 
Our assumptions on m, r now yield that 

logs = n(Ao - x) + ne-^°{l + Ao)(x - Aq) 

+ n (1 - e-^o (1 + Ao)) (x - Aq) + 0(1) = 0(1) , 
completing the proof. ■ 

Using the above estimate we are now able to conclude the main result 
of this section, which reduces the 2-core of Poisson-cloning to the graph 
generated by the Poisson-configuration model. 
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Proof of Theorem 13.21 Recall that H is the random graph generated by 
the Poisson-configuration model, and G^^-* is the 2-core of Poisson-cloning. 

Fix 6 > 0, and with the statement of Theorem 13.31 in mind, as well as the 
definition of A as Gaussian with parameters (A — /x, 1/n), set 

B = {X-i_i- M/Vn, X-IJ + Af/Vn) , 

where M = M(5) is a sufficiently large constant such that 

P(rpc eB,AGB)>l-6. 

Following the notation of Lemma 13.51 let A^^ be the number of degree-A; 
vertices in H. Conditioned on A = x we have that Ylk>3 = Ylu ^{Du>3} 
where the Du are i.i.d. Po(x), hence the Central Limit Theorem implies that 
J2k>3 is concentrated around n(l — e~^(l + x + ^)) with a window of 
0{^/n). A similar statement holds for J2k>3^^k = X^m -^«1{D„>3}- Since 
A is Gaussian with variance 1/n, removing the conditioning on A introduces 
0(1/ y/n) fluctuations which translate to 0{^/n) fluctuations in the above 
mentioned random variables. Altogether, if M{d) > is large enough then 
each of the variables J2k>3 ^k and J2k>3 ^-^k is within M^/n of its mean, 
except with probability b. 

Let F denote the set of all sequences : > 3} which satisfy the 
assumptions of Lemma 13.51 By the above discussion, we can restrict our 
attention to degree-sequences in F at a cost of events whose probability 
is at most b. By the conclusion of that lemma, the probability to have 
\Nk : A; > 3} G F given A = a; for some x G i? is uniformly bounded below 
and above by the corresponding probability given A = A — In particular, 
for any x ^ B the event that is in ^ n {{N^ : A; > 3} G F| given A = x 
has up to constants (that depend on b) the same probability for any x in 
this region. 

Further fix 5' > and define 

D = {x € B -.FiH £ A, {Nk{H) : k > 3} e T \ A = x) > b'} . 

We claim that for any large enough n we have D = 0. Indeed, this follows 
immediately from the discussion above, since the existence of some x & D 
would imply that in particular 

¥{H £A, {Nk{H) :fc>3}GF|AG5)> b'/c{b) , 

which, since P(A e B)>l-b, contradicts the fact that F{H e A) = o(l). 

With D = 0, reiterating the argument on the uniformity for on x G -B of 
the above probability given A = x we deduce that for any x G B, 



F{H eA, {NkiH) : A; > 3} G F I A = x) < y . 
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To complete the proof, observe that the Poisson-configuration model given 
A = X is equivalent to the Poisson-cloning model given Tpc = x. Therefore, 
combining the arguments thus far we arrive at the following estimate, valid 
for any x € B: 



where the 5-term accounted for the probability of {A^fc(if) : k > 3} ^ T. 
The proof is concluded by recalling that Tpc G B except with probability 5, 



4. Constructing the 2-core of the random graph 

So far we have reduced the 2-core of Poisson-cloning to the simpler Poisson- 
configuration model as given in Definition 13. 1[ In this section we will reduce 
the Poisson-configuration model to the Poisson-geometric model, defined as 
follows. Recall that /i < 1 is the conjugate of A > 1 as defined in Theorem[TJ 

Definition 4.1 (Poisson-geometric model for n and p = X/n). 

(1) Let Kr^ M{\- n, and assign an independent Po(A) variable to 
each vertex u. Let = i^{u : = and N = X]fc>3 ^k- 

(2) Construct a random multigraph fC on N vertices, uniformly chosen over 
all graphs with degree-k vertices for k > 3 (if Ylik>'i^-^k is odd, 
choose a vertex u with Du = k > 3 with probability proportional to k, 
and give it k — 1 half-edges and a self-loop). 

(3) Replace the edges of fC by paths of length i.i.d. Geom(l — //). 

Theorem 4.2. Let H be generated by the Poisson-configuration model w.r.t. 
n and p = \/n where \ > 1 is fixed. Let H be generated by the Poisson- 
geometric model corresponding to n,p. Then for any set of graphs A such 
that F{H £A)^0, we have F{H £A)^0. 

Both models clearly share the same kernel and only differ in the way this 
kernel is then expanded to form the entire graph (replacing edges by paths). 
To prove the above theorem we need to estimate the distribution of the total 
number of edges in each model and show they are in fact contiguous. A first 
step towards this goal is to control the number of edges in each model. Fix 
some large M > 0, and let Bm denote the following set of "good" kernels: 



where Aq = A — /i. The next lemma estimates the number of edges in the 
Poisson configuration model given that the kernel belongs to Bm as above. 



P(g(2) eA\Tpc = x)=F{HeA\A = x)<6' + 6 





(4.1) 
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Lemma 4.3. Define M > 0, Im,I3m o.s above and set Aq = A — /i. Let H be 

generated by the Poisson- configuration model. There exists some constant 
c{M) > so that for any IC G Bm and s with |s-| (Aq - e^^oAo) | < My/n, 



P [\E{H)\ = s , A e /m I ker(F) = /C) < . 

yn 

Proof. Let x € Im and K. G ;Sm, and write m = |/C| and r = |-E(/C)| for the 
number of vertices and the edges in the kernel respectively. We will first 
estimate ¥[\E{H)\ = s | A = x, ker(ff) = /C), and the required inequality 
will then readily follow from an integration over x G Im- 

Note that, given A = x and ker(i^) = /C, the number of edges in H is the 
r edges of /C plus an added edge for each degree 2 variable out of the n — m 
variables (i.i.d. Po(x)) that have {u : Du < 2}. That is, in this case 

/ 

|E'(F)| ~ r + Bin n - m 



l + x + x2/2^ 

It is straightforward to verify that for x £ Im and /C G Bm, we have 
K{\E{H)\\A = x,keT{H)=lC)=r + {n ^^^^ 



1 + X + x2/2 
^ (Ao - e-^«Ao) + OiV^) = s + OiV^) . 
The required estimate now follows immediately from Theorem 12. 2[ ■ 
We now turn to analyze the number of edges in Poisson-Geometric model. 



Lemma 4.4. Let M > and Bm be as in (I4.ip . Let H be generated by the 
Poisson-geometric model. There exists some constant c = c{M) > so that 
for any K. G Bm and s with \s — ^{X — fx) [l — j) \ < M^/n, 

E{H)\ = s\ ker(F) =/c) > ^ 



n 



Proof. By definition, given that kei{H) = IC, the variable \E{H)\ is the sum 
of |i?(/C)| i.i.d. geometric variables with mean 1/(1 — /i). 

Denote by r the number of edges in the kernel fC, and let s be a candidate 
for the number of edges in the expanded 2-core H. As stated in the lemma 
(recall definition ()4.ip ). we are interested in the following range for r, s: 
n 

r = -(A - fi)il - ^)(1 + ciV^, (|ci| < M) , 

Tl 

s = -{X - - l^) + C2V^, {\C2\<M). 

It is clear that given ker(lf) = IC with |£'(/C)| = r, the number of edges in H 
is distributed as Yll=i -^i, where the Xj's are independent geometric random 
variables with mean i-e., P(^i = k) = fi^~^{l—fi) for k = 1,2,.... Since 
s = + 0{y/r), the desired estimate now follows from Theorem 
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We now establish the main result of this section, Theorem 14.2^ reducing 
the Poisson-configuration model to the Poisson-geometric model. 

Proof of Theorem 14. 2[ For some constant M > to be specified later, 
define the event 

Am = { a G Im, kei{H) e Bm, \\E{H)\ - §(A - ^)(1 - ^)| < M^^ } . 

Fix 5 > 0. We claim that for a sufficiently large M = M{6) we have 
F{Am) > 1 — S. To see this, note the following: 

1. In the Poisson-configuration model, A ~ (A — /x, ^) , and Im includes 
at least M standard deviations about its mean. 

2. Each of the variables |/C| and E{IC) is a sum of i.i.d. random variables 
with variance 0{n) and mean as specified in the definition Bm, hence 
their concentration follows from the CLT. 

3. Finally, E{H) is again a sum of i.i.d. variables and has variance 0{n), 
only here we must subtract the vertices that comprise disjoint cycles. 
By Corollary [331 the number of such vertices is Op(l), which is negli- 
gible compared to the 0{^/n) standard deviation of E{H). 

Given an integer s and a kernel /C, let Ps,/c denote every possible 2-core with 
s edges and kernel /C. Crucially, the distribution of the Poisson-configuration 
model given E{H) = s and kev{H) = /C is uniform over 'Dg^ic, and so is the 
Poisson-geometric model given E{H) = s and ker(i?) = /C. Therefore, for 
any graph D gVs^ic, 



F{H = D 


ker{H) 


= /C) I 


^{\E{H)\ =s \ ker{H) 


= /C) 


F{H = D 


ker(#) 


= /C) I 


''i\E{H)\ = s ker(^) 


= /C) 



Combining Lemmas 14.31 and 14.41 we get that for some c = c{M) > 0, 



Fi\EiH)\=s , Am\ 


ker(F) = /C) 


F{\E{H)\ =s\ 


keiiH) = fC) 



Recalling that ¥{Am) > 1 — 6 and letting 5 — t- 0, we deduce that for any 
family of graphs A, if F{H £ A) ^ then also F{H £ A) ^ 0. U 

5. Constructing the giant component 

We begin by analyzing the trees that are attached to the 2-core of 
G. As before, ^ < 1 is the conjugate of A > 1 as defined in Theorem [TJ 

Proof of Theorem [1]. Write PGW(;u)-tree for a Poisson(;u)-Galton- Watson 
tree, for brevity. Let Ci denote the graph obtained as follows: 

(2) 

• Let H he a copy of C| (the 2-core of the giant component of G) . 

• For each v £ H, attach an independent PGW(//) tree rooted at v. 



14 



JIAN DING, EYAL LUBETZKY AND YUVAL PERES 



By this definition, Ci and Ci share the same 2-core H. For simphcity, we 
will refer directly to H as the 2-core of the model, whenever the context of 
either Ci or Ci is clear. We first establish the contiguity of Ci and Ci. 
Define the trees decorating the 2-core of Ci as follows: 



Clearly, each r„ is a tree as it is connected and has no cycles (its vertices 
were not included in the 2-core). We go from H to Ci by attaching the tree 
Tu to each vertex u G H (while identifying the root of with u) . Similarly, 



We next introduce notations for the labeled and unlabeled trees as well 
as their distributions. For t G N, let TZt be the set of all labeled rooted 
trees on the vertex set [t] = {1, . . . and let Ut be chosen uniformly at 
random from TZt- For T €z TZt and a bijection (/) on [t], let (/>(T) be the tree 
obtained by relabeling the vertices in T according to (p. Further let T' be the 
corresponding rooted unlabeled tree: T' = {(/){T) : is a bijection on [t]}. 

Let {tu : u £ H} be some integers. Given that { \Tu\ = tu for all u G H}, 
we know by definition of Q(n,p) that is independently and uniformly 
distributed among all labeled trees of size tu rooted at u. In particular, 
given this event each is independently distributed as C//^ (the unlabeled 
counterparts of and Ut^). On the other hand, Aldous [3] (see [1]) observed 
that if T is a PGW-tree then T' has the same distribution as f7/ on the event 
{\T\ = t}. Thus, conditioned on the event { \Tu\ = tu for all u £ H} we also 
get that has the same distribution as [//^ . 

We now turn to the sizes of the attached trees in Ci and Ci. Letting 
{tu : u G H} be some integers and writing N = YlueH , we claim that 
by definition of Q{n,p) every extension of the 2-core H to the component 
Ci, using trees whose sizes sum up to A^, has the same probability. To see 
this argue as follows. Fix H and notice that the probability of obtaining a 
component with a 2-core is H and an extension X connecting it to — \H\ 
additional vertices only depends on the number of edges in H and X (and 
the fact that this is a legal configuration, i.e., H is a valid 2-core and X is 
comprised of trees). Therefore, upon conditioning on H the probabilities of 
the various extensions X remain all equal. Cayley's formula gives that there 
are m^~^ labeled rooted trees on m vertices, and so. 



Tu = {v £ Ci : V is connected to u in Ci \ H} for u £ H . 



let {Tu}u€H be the analogous trees in Ci. 



(I 



Tu\ = tu for allueH\H)=F {\Ci\ = N \ H) 



1 N\ 



1 



p 



ueH 



1 -w—r rf*"~^ 1 



(5.1) 
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where Z(N) and Z'{N) are the following normalizing constants 

z'{N) = E n [^(^«"')" 

Z{N) = Z'(iV)/i^-l^le-'^^. 

It is well-known that the size of a Poisson(7)-Galton- Watson tree T follows 
a Borel(7) distribution, namely 

P(|r|=t) = ^(7e-^)*. (5.2) 

Recalling that are independent PGW(^)-trees, it follows that 

Combining this with (15. ip and (15. 2p . we obtain that 

— AT \ U\ 

(5.3) 



F(|r,| 


= tu for all ti 


G H 


^) 


_ n\ci\ 


= n\h) 


iP(l^«l 


= tu for all u 






n\ci\ 


= N\H) 



At this point, we wish to estimate the ratio in the right hand side above. 
To this end, we need the following result of |20] . 



/fei(A)\ 

Theorem 5.1 ([201 Theorem 6], reformulated). Let b(A) = I b2(\) \ where 

6i(A) = (l-M)(l-^) , 62(A) =/.(!- ^) , 63(A) = i(l-^) (A + ^.-2). 

There exist positive definite matrices Kp{X) and Km{X) such that 

(i) {\H\,\Ci\ — \H\,\E{H)\ — \H\) is in the limit Gaussian with a mean 

vector nh and a covariance matrix uKp. 
(a) If Am = and B denotes the event that \E{G)\ = m for some 

m = (1 + o(l))An/2, and there is a unique component of size between 
^(1 — j)n and 2(1 — j)n and none larger, then 

F {\H\ = m, \Ci\ - \H\ = na, \E{H)\ - \H\ = I 5) 



V^det(^^) + 0(1) jn 
(27rn)3/2 ^""P^ 2^' 



exp(-ix^A„x) , (5.4) 



uniformly for all {ni,n2,n^) G such that 

{Kp{l, l)-'/\uKp{2, 2)-i/2x2, Kp{3, 3)-i/2x3) 
is bounded, where = {xi,X2,X3) is defined by 

x'^ = -^(ni - bin, n2 - 62/1, - b^n) . (5.5) 
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By CLT it is clear that w.h.p. the total number of edges in G ~ Q{n,p) is 
(1 + o(l))An/2. Furthermore, by results of |8] and [T7] (see also [H]), w.h.p. 
our graph G has a unique giant component of size (1 + o(l))(l — fj,/X)n. 
Altogether, we deduce that the event B happens w.h.p.; assume therefore 
that B indeed occurs. Recalling (|5.5p where (xi, X2, X3) is given as a function 
of (ni, 77,2,71.3), define the event Q by 

Qm = {(m,n2,n3) G : |xi| < M for i = 1,2,3} , 

Q = {m,\Ci\ - \H\, \E{H)\ - \H\) e Qm} • 

By part (ji|) of Theorem 15.11 for any fixed 5 > there exists some M > 
such that P((5^) < S for a sufficiently large n. Next, define 

Pmax = max P {\H\ =ni, \Ci\ - \H\ = na, \E{H)\ - \H\ = m) , 

(rai,n2,n3)GQA/ 

Pmin= min P(|F| =ni, =n2, |^(//)| -|//| =n3) . 

(rai,n2,n3)GQA/ 

It follows from part ^ of Theorem 1 5 . 1 1 that there exists some c = c(M) > 
such that 

when n is sufficiently large. Notice that by definition of x, 

#{n2 G N : \x2\ < M} > M^/R. 
Combined with (j5.6p . it follows that for any (ni,n2,n3) G Qm we have 

P(|Ci| =ni + n2 , g| =ni) < — (5.7) 

With this estimate for P(|Ci| = N | H), the numerator in the right-hand-side 
of ()5.3p . it remains to estimate the denominator, P(|Ci| = | H). 

Recall that, given H, the quantity \Ci\ is a sum of \H\ i.i.d. Borel(;u) 
random variables (each such variable is the size of a PGW(//)-tree). We 
now wish to derive a local limit theorem for \Ci\, to which end we again 
apply Theorem 12.21 It is well known (and easy to show, e.g. [11^ Claim 4.2]) 
that a Borel(//) variable has expectation 1/(1 — fi) and variance /u/(l — /u)^ 
for any < /x < 1. Recalling the definition of Qm, we are interested in the 
following range for ni and 77-2: 

ni = {1 - - l^)n + ciV^ (|ci|<M), 
712 = m(1 - ^)n + C2^/^ {\C2\<M). 

Applying the local CLT now implies that for some 6' > 0, 

F{\Ci\=ni + n2\\H\=ni)>6'/,/n. (5.8) 
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Combining (|5.7p and (15. Sp . we obtain that when n is sufficiently large, 



F{\Ci\=N , Q\ 




p(|Ci| =n\ 




1 



By (|5.3p (and recalling that conditioned on \Ti\ the tree Tj is uniformly 
distributed among all unlabeled trees of this size, and similarly for Tj), we 
conclude that for some c' = c'{M) > and any unlabeled graph A 

¥{ei=A,Q,B\H)<c'¥{Ci = A\H). (5.9) 

We are now ready to conclude the proof of the main theorem. Let Ci be 
defined as in Theorem [TJ For any set of simple graphs A, define 

H = |if :P(Ci eA,Q,B\ Cf^ =H)> (P(Ci e A))^/'^^ . (5.10) 

Recall that by definition, Ci is produced by first constructing its 2-core 
(first two steps of the description), then attaching to each of its vertices 
independent PGW(/i)-trees. Hence, for any H, the graphs Ci and Ci have 

^2) ~(2) 

the same conditional distribution given C\ = C\ = H. It then follows 
from (|5.9p . (j5.10p that for some constant c" > and any H £ Ti, 

P(Ci G A I Cf ^ =H)> c"(P(Ci G ^))i/2 _ 

Since we have 

p(Ci eA)> c"(p(Ci G ^))^/¥(cf ^ G n) , 

the assumption that P(Ci G A) ^ gives that P(cf ^ G 7^) 0. 

At this point, we combine all the contiguity results thus far to claim that, 
for any family of simple graphs 

P(cf ^ G -F) = o(l) implies that P(cf ^ G -7^) = o(l) . 

Indeed, by definition, the 2-core of Ci is precisely the Poisson-geometric 
model, conditioned on the sum of the degrees d^^t -Dul_D„>3) being even. 
Thus, as J- has only simple graphs, clearly we may consider the model con- 
ditioned on producing a simple graph and in particular that J2u Du^Du>3 is 
even. Applying Theorem 14.21 (contiguity with Poisson-configuration) , Theo- 
rem [3?2] (contiguity with Poisson-cloning) and Theorem 12. II (contiguitv with 
Erdos-Renyi graphs), in that order, now gives the above statement. 

(2) 

This fact and the arguments above now give that P(Cj^ G Ti) — )• 0. By 
the definition of Ti, we now conclude that 

P(Ci G ^) < F{B^) + P(Q'=) + P(cf ^ £n) + (P(Ci G ^))i/2 , 

where the last term converges to by assumption. Taking a limit, we get that 
lim sup„_^oQ P(Ci £ A) < 6 and the proof is completed by letting 6^0. ■ 
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